Integrating Syntagmatic Information in a Dictionary for Computer Speech Applications

نویسنده

  • Dieter Huber
چکیده

Conventional dictionaries, albeit they often comprise an impressive amount o f para­ digmatic information on various aspects o f linguistic description, usually pay only little attention to the representation o f syntagmatic information. Admittedly, apart hrom spelling conventions and rules o f inflectional agreement, the co-occurrence o f indivi­ dual lexical items will not normally change the orthographic shape o f a word when it appears in written text. In spoken language, however, the phonetic realization o f words is heavily influenced by context and may change dramatically in a variety o f ways, including segmental as well as prosodic features. These changes need to be taken into account in both computer speech synthesis and automatic qreech recognition. In this paper, dierefore, we argue fathe inclusion of syntagmatic information in dictionaries which are developed for the special purpose o f spdcen language processing in computer speech applications. Two kinds of syntagmatic information will be considered in more detail: Case Frames and Collocations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Enhancement using Adaptive Data-Based Dictionary Learning

In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...

متن کامل

A new Dictionary of Swedish Pronunciation

This paper describes some aspects of a pronunciation dictionary for Swedish, "Svenskt Utlalslexikon" (SUL), which is piesenUy developed at our departm ent This dictionary provides, among other items, three kinds of information about Swedish pronunciation that are not included in standard dictionaries: information on varian ts , on inflected form s and com pounds, and on p ro p er names. SUL is ...

متن کامل

Automatic Construction of Persian ICT WordNet using Princeton WordNet

WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...

متن کامل

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...

متن کامل

Concept-to-speech generation by integrating syntagmatic features into HMM-based speech synthesis

In conventional concept-to-speech (CTS) methods, a common step is predicting abstract prosodic descriptions, such as the locations of accents and phrase boundaries, from the linguistic information provided by the text generation module. But the prediction results always contain errors, and unacceptable prosodic prediction may ruin the synthesized speech. In addition, linguistic information, whi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1991